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Approximating functions on stratifiable sets 



D. Drusvyatskiy and M. Larsson 



Abstract 

We investigate smooth approximations of functions, with prescribed gradient behavior on a 
distinguished stratifiable subset of the domain. As an apphcation, we outhne how our results 
yield important consequences for a recently introduced class of stochastic processes, called the 
matrix-valued Bessel processes. 



1. Introduction 

Nonsmoothness arises naturally in many problems of mathematical analysis. A conceptually 
simple way to alleviate the inherent difficulties involved is by smoothing. The celebrated 
Approximation Theorem of Whitney — a prototypical result in this direction — shows that any 
continuous function / on R" can be uniformly approximated by a C°°-smooth function g. See 
for example [oj Theorem 10.16]. In light of Whitney's result, it is natural to ask the following 
vague question. Can we, in addition, guarantee that such an approximating function g satisfies 
"useful" properties on a distinguished nonsmooth subset Q of R"? In the current work, we 
consider prototypical non-pathological sets Q, namely the sets Q that are composed of finitely 
many smooth manifolds {Mi} that fit together in a regular pattern. The "useful" property we 
would like to ensure is that the the gradient of the approximating function g at any point in 
Q is tangent to the manifold containing that point, that is the inclusion 

\7g{x) e TxMi holds for any index i and any point x e Af^. 

This requirement can be though of as a kind of Neumann boundary condition. To the best of 
our knowledge, such a question has never been explicitly considered, and therefore we strive 
to make our development self-contained. 

In the current work, we provide an entirely constructive argument showing the existence of a 
continuously differentiable approximating function satisfying the tangential condition, provided 
that the partitioning manifolds yield a Whitney stratification. In particular, our techniques are 
applicable for all semi-algebraic sets — those sets that can be written as a union of finitely 
many sets each defined by finitely many polynomial inequalities. For more details on semi- 
algebraic geometry, see for example [2] [s]. Guaranteeing a higher order of smoothness for 
the approximating function seems fairly difficult, with the curvature of the manifolds entering 
the picture. However we identify a simple and easily checkable, though stringent, condition 
on the Whitney stratification, which we call normal Eatness (Definition |2.6[ ), that bypasses 
such technical difficulties and allows us to guarantee that the approximating function has an 
arbitrary order of smoothness. 

At first sight, the normal flatness condition is deeply tied to polyhedrality. However, we prove 
that this condition satisfies the so-called Transfer Principle investigated for example in 11 
|4j |13[ |10[ |15[ [5j [6] . Consequently this condition holds for a number of important subsets of 
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matrix spaces, and our strongest results become applicable. Finally we apply our techniques to 
the study of a class of stochastic processes called the matrix-valued Bessel processes, introduced 
in [s] . We give an informal outline of how our results constitute a key component needed to 
obtain a good description of the law of the process, and how they enable powerful uniqueness 
results to become applicable. Indeed, this was the original motivation for the current work. 

The outline of the manuscript is as follows. In Section[2] we record basic notation that we will 
use throughout, and state the main results of the paper. Section |3] contains the proofs of the 
main results. In Section[4j we discuss the Transfer Principle and how it relates to stratifications, 
while in Section [5] we outline an application of our results to matrix- valued Bessel processes. 



2. Basic notation and summary of main results 

Throughout, the symbol | • | will denote the standard Euclidean norm on R" and the absolute 
value of a real number, while || • || will denote the Frobenius norm on the space of linear 
operators on R". A function /: Q — > R defined on a set Q C R" is called n-Lipschitz continuous 
on Q, for some k > 0, if the inequality 

\f{x) — f{y)\ < k\x — y\ holds for all x,y £ Q. 

The infimum of k satisfying the inequality above is the Lipscliitz modulus of /, and we denote 
it by lip/. For notational convenience, 1-Lipschitz functions will be called non-expansive. The 
function / is said to be locally Lipschitz if around each point x G Q, there exists a neighborhood 
U so that the restriction of / to C/ is Lipschitz continuous. 

Given any set Q C R" and a mapping f:Q—>^Q, where Q C R™, we say that / is CP-smooth 
if for each point x G Q, there is a neighborhood U oix and a C^-smooth mapping /: R" R™ 
that agrees with / on Q n t/. It is always understood that p lies in {1,2,..., oo}. 

The following definition is standard. 

Definition 2.1 (Smooth manifold). Consider a set M c R". We say that M is a 
manifold (or "embedded submanifold" ) of dimension r if for each point x S M, there is an open 
neighborhood U around x such that M n C/ = i^~^(0) for some C smooth map F:U ^ R"^'', 
with the derivative T)F{x) having full rank. 

The tangent space of a manifold M at a point x G M, will be denoted by T^M, while the 
normal space will be denoted by N^M. Most non-pathological subsets of R", which occur in 
practice, can be decomposed into finitely many manifolds that fit together in a regular pattern. 
The following definition makes this precise. 

Definition 2.2 (Stratifications). A CP- stratification of a set Q C R" is a partition of Q 
into finitely many manifolds, called strata, satisfying the following compatibility condition. 
Frontier condition: For any two strata L and M, the implication 

L n cl M 7^ L C (cl M) \ M, holds. 

A stratification A is said to be a CP-Whitney stratification, provided that the following 
condition holds. 

Whitney condition (a): For any sequence of points Xk in a stratum M converging to 
a point x in a stratum L, if the corresponding normal vectors Vk G N^^M converge to a 
vector V, then the inclusion v S N^L holds. 
It is reassuring to know that most non-pathological sets admit such stratifications: every 
subanalytic set admits a C-Whitney stratification, for any finite p, as does any definable set 
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in an arbitrary o-minimal structure. In particular, this is true for semi- algebraic sets. For a 
discussion, see for example [16 . 

To make our arguments easier to follow, throughout the paper we will assume that given any 
strata L and M of a stratification A, if the inclusion L C (cl M) \ M holds, then the dimension 
of L is strictly smaller than that of M . We should stress that this assumption is for notational 
convenience only. If it did not hold, then we would need to work explicitly with the partial 
order ^ on the strata, defined by 

L<M ^ Ld clM. 

We are now ready to state the main results of this paper. Their proofs are given in Section|3] 

Theorem 2.3 (Approximation on stratified sets). Consider a continuous function f: R" — > 
R and a closed set Q C R". Then for any continuous function e: R" — > (0, oo) and any C^"*"^- 
stratification A of Q, there exists a Frechet differentiable function g: R" — > R satisfying the 
following properties. 

Uniform closeness: The inequality \ f{x) — g{x)\ < e{x) holds for all x e R". 

Boundary condition: For any stratum M G A and any point x G M, the inclusion 

Vg(a;) € T^M holds. 

Lipschitz modulus: The Lipschitz modulus of g is no greater than 13'"+^ hp/, where 
m is the number of strata in A. 

Support: Given a neighborhood V\ of the support of f, we may choose g so that its 
support is contained in Vi . 
If A is a C^'^^-Whitney stratihcation, then we may ensure that g is C^-smooth. 



Remark 2.4. In theorem 2.3 if / already had the desired order of smoothness (say C'^ 
in the case when ^ is a Whitney-stratification), then we may guarantee that g and / agree 
outside of V2, where V2 is an arbitrary neighborhood of the boundary of Q. 



Remark 2.5. The hypothesis on e can be weakened to lower semicontinuous. Indeed, it is 
easy to show that any strictly positive, lower semicontinuous function on R" can be minorizcd 
by a strictly positive continuous function. 



Observe that even under the Whitney condition (a), the approximating function g, guaran- 



teed to exist by Theorem 2.3 is only C^-smooth. To guarantee a higher order of smoothness, 
it seems that one needs to impose stronger conditions, both on the stratification and on the 
curvature of the strata. Below we identify a simple, though stringent, condition which bypasses 
such technical difficulties. 

Given a set Q C R" and a point x E Q, the distance of x to Q is 

d{x, Q) := inf |a: - y|, 

and the projection of x onto Q is 

Pqi.^) ■.= {yeQ.\x-y\ = d{x,Q)}. 

Definition 2.6 (Normally fiat stratification). A finite -stratification of a subset Q of 
R" is said to be normally flat if for any two strata L, M £ A with L £ (cl Af) \ M, there are 
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neighborhoods V of L and U of Af so that equahty 

Pl (x) ^ Plo Pm {x) holds for all x£Vf\U. 



The meaning of the normal flatness condition is that projecting onto L is the same as 
first projecting onto M and then onto L, at least on some set close to L where Pm is well 
behaved. This condition is of course quite strong; nonetheless it is satisfied in a number 
of situations. Section [4] contains a more detailed analysis, which in particular shows that 
polyhedral sets and spectral lifts of polyhedra, having natural symmetry properties, admit 
normally flat stratiflcations (Proposition 



4.4 



and Theorem 4.8). 



We should note that normally flat stratification are precisely the stratifications that admit 
control data, in the sense of Thom-Mather theory, where the "tubular neighborhoods" arise 



specifically from normal bundles. For a discussion of control data, see the notes 14 



The next result is a strengthened version of Theorem 2.3 under the normal flatness condition 



Theorem 2.7 (Approximation on normally flat stratiflcations). Let Q, A, f and e be as in 
Theorem \2.3l and suppose A is in addition normally Bat. Then there is a function g : R" — > R 
that satisfies all the properties of Theorem \2.3\ as well as the following. 

Smoothness: g is smooth. 

Boundary behavior: For any stratum M £ A, any point x G M, and any normal 
direction v £ N^M , the function R 9 1 1-> g{x + tv) is constant in a neighborhood of the 
origin. 



In particular, Theorem |2.7| directly implies that for any stratum M, any point x £ M, and 
any normal vector v £ N^M , we have 



D'^'^^ g{x)[v, . . . ,v] = 0, for any k — 1,. . 



where D^''^ g(x)[v, . . . ,v] is the fe'th order directional derivative of g at a; in direction v. This is 



a significant strengthening of the conclusion in Theorem 2.3 



3. The iterative construction and proofs of main results 



Our method for proving results such as Theorems 2.3 and 2.7 relies on an iterative procedure, 
where the original function / (possibly after some "pre-processing") is first modified on the 
zero-dimensional strata, then on the one-dimensional ones, and so on. After step k, the resulting 
function approximates /, and has the desired properties on all strata of dimension k or less. 
When k = n the desired properties hold on all of Q. In this section we give a detailed description 
of this iterative construction, in particular the induction step going from fc to A; -|- 1. The reason 
for bringing out some of the details, as opposed to hiding them inside proofs, is that the same 
general approach sometimes can be used in specific situations to obtain further properties of 
the approximating function. An example of this will be discussed in Section [5j 

The induction step has two crucial ingredients, namely (1) finding a suitable tubular 
neighborhood of the stratum where the current function is to be modified, and (2) an 
interpolation function constructed using this tubular neighborhood. These two objects are 
described in Subsections |3.1| and |3.2[ Then, in Subsection |3.3| the induction step is described 



in detail, laying the groundwork for the proofs of the main results, given in Subsection |3.4[ 
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3.1. The tubular neighborhood 

In this section, we follow the notation of [9| Section 10]. We should stress that this section 
does not really contain any new results. Its only purpose is to record a number of observations 
needed in the latter parts of the paper. 

For a C^"*"^ manifold M C R", the normal bundle of M, denoted by NM, is the set 

NM := {{x,v) e R" X R" : a; G M,t; G N^M}. 

It is well-known that the normal bundle NAI is itself a manifold. Consider the mapping 
E : NM R" defined by 

E{y,v) = y + v. 

A tubular neighborhood t/ of M is by definition a diffeomorphic image under E of an open 
subset V C NM having the form 

V^{iy,v)eNM ■.\v\<S{y)}, (3.1) 

for some strictly positive continuous function S: M R. It is standard that tubular neighbor- 
hoods of M always exist. See for example [9^, Theorem 10.19]. Furthermore there always exists 
a tubular neighborhood on which the projection mapping Pm is C^-smooth. Then for any 
point X lying in M, the derivative of the projection DPm{x) is simply the metric projection 



onto the tangent space T^M. See for example 12 Lemma 2.1] 



The following observation shows that we may squeeze the tubular neighborhood inside any 
given open set containing the manifold. 



Lemma 3.1 (Existence of tubular neighborhood). Consider a C^^^ manifold M C R", as 
well as an arbitrary neighborhood V of M. Then there exists a tubular neighborhood 

Us = {y + ve'R" -.yeM, ve NyM, \v\ < S{y)} 

satisfying Ug C V. Furthermore for any real n > 0, we may ensure that 6 is n-Lipschitz. 



Proof. By intersecting V with some tubular neighborhood of M, we may assume that V is 
the diffeomorphic image of some neighborhood of the zero-section M x {0} in NAI. Now, for 
any point y £ M, define 

S{y) ■.^snp{0<e<l:V,{y)cV}, 

where 

V,{y) := {2 -I- u G R" : z G M, v £ N^M, \y - z\ < e, \v\ < e}. 

Clearly S is strictly positive and the inclusion Us C V holds. We now show that S is 
nonexpansive. To see this, first note that whenever the inequality |a; — yj > S(x) holds, we have 
S{x) — 6{y) < \x — y\ trivially. On the other hand, if we have |a:: — ?/| < S{x), an application of 
the triangle inequality shows that the inclusion Ve{y) C Vs(^x){x) is vahd for e = S{x) — \x — y\. 
We deduce S{y) > e, and hence the inequality S{x) — S{y) < \x — y\ is also valid in this case. 
Interchanging the roles of x and y gives the non-expansive property. Finally, replacing S with 
kS, if need be, ensures that S is K-Lipschitz. □ 

In fact in Lemma [3T] we may ensure that S is C°° smooth. To see this, we need the following 
result, which has classical roots. 



Proposition 3.2 (Approximation of Lipschitz functions). Consider any C^+^ manifold 
M C R" and a function f: M — )■ R that is Lipschitz continuous on M . Then for any continuous 
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function e: M ^ (0, oo) and a real r > 0, there exists a Lipschitz continuous, C°° -smooth 
function f:M^R satisfying 

\f{x) - f{x)\ < e{x) for aU x € M, 

with lip / < lip / + r. 

Proof. The proof proceeds by extending the functions / and e to an open neighborhood 
of M, using standard approximation techniques on this neighborhood, and then restricting 
the approximating function back to M. To this end, for any real e > there exists an open 
neighborhood U of M so that the projection Pm is (1 + e)-Lipschitz on U. Consider now 
the functions /: [/ R and £:[/—>• (0, oo), defined by f{x) = fiPnix)) and e{x) = e{PM{x))- 
Observe that / agrees with f on M and e agrees with e on M , and furthermore the inequality 
lip / < (1 + e)lip/ holds. It is standard then that for any r (see for example [l] Theorem 1]) 
there exists a C°° function f on U satisfying 

\f{x) - f{x)\ < e{x) for ah x e U, 

with lip / < lip / + r. In particular, we deduce lip / < lip / + clip f + r. Since e and r can be 
chosen to be arbitrarily small, restricting f to M yields the result. □ 



Coming back to Lemma 3.1 an application of Proposition 3.2 with f — and e ~ j5 shows 
that there is no loss of generality in assuming that S is C°°-smooth and non-expansive. For 
ease of reference, we now record a version of Lemma |3.1[ where we impose a number of open 



conditions on the tubular neighborhood, which will be important for our latter development. 



Corollary 3.3. Consider a Cp+^ manifold M C R", continuous functions e : R" — ^ 
(0, oo) and f: R" R, a neighborhood V of M, and the closed set T := (clM) \ M. Then 
there exists a tubular neighborhood 

Us = {y + veW -.ye M, v e NyM, \v\ < Siy)} , 

with Us C V O r^, and satisfying 

(i) The width function 5:M ^ (0,oo) is C°°-smooth and non-expansive, 

(ii) The metric projection Pm is -smooth on Us, 
and for each x (z Us we have 

(iii) \fix)-foPMix)\<six), 

(iv) \x-PM{x)\<dix,rf, 

(v) \\DPm{x) - BPM{y)\\ < diy,T), where y = Pm{x), 

(vi) \\BPm{x)\\ < 2. 



3.2. The interpolation function 



Given a tubular neighborhood Us as in Corollary 3.3 (constructed based on a C^^-^ manifold 
AI, a neighborhood V, functions e and /, and the closed set P — (clM) \ M), define a function 
(j) : R" — > R by setting 



x^Us 



where ip : [0,oo) — > [0, 1] is a C°°-smooth function that is 1 on [0, 1/4], vanishes on [3/4, oo), 
and whose first derivative is bounded by 7/3 in absolute value. See Figure[l]for an illustration. 
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Consider also the "annulus" around M, defined by 

Uo = l^xeUs:^6o Pm{x) < \x - Pm{x)\ < h o Pm(x)^ . 

We refer to the function cf) as the interpolation function associated with Us, and its decisive 
properties are cohected in the fohowing lemma. 



1 




Figure 1. The function ^. 



Lemma 3.4 (Properties of the interpolation function). The interpolation function cf) satisfies 
the following properties. 

(i) 0=1 on {xeUs:\x~ Pm{x)\ < ^JoPm{x)}, 

(ii) (b^Qon{xeUs:\x- Pm{x)\ > fS o Pm{x)} and on Ug, 

(iii) (f) is smooth on V^, and for all x G Uq, we have 

~Pm{x) V{SoPM)ix) 



W(j){x) = tij'{t) 



\x~ Pm{xW 



6oPm{x) 



where t 



\x-Pm{x)\ 
SoPm{x) ■ 

(iv) For all x e Us the inequality, \x — PMix)\ \V4>{x)\ < 8, holds. 



Proof. The first three properties follow directly from the definition, together with a 
calculation to compute the gradient. The only point that is somewhat delicate here is to verify 
the equality 

X ~ Pm{x) 



for X € Us\ M. To this end, an application of the chain rule yields the equation 

{u-DPm{x)[u],x- Pm{x)} 



(3.2) 



M 



{x),u) = 



\x-Pm{x)\ 



for any vector u G R". This together with the inclusions DPm{x)[u] E Tp^j(^^-^M and x~ 
Pm{x) e Npj^(^^)M, establishes ^?2\ . 

For the last property, first observe V0 = outside Uq. Hence it is sufficient to consider only 
points x G Uq. For such points, observe that the inequalities, \til)' {t)\ < \ and \x — Pm{x)\ < 
S°Pm{x), hold. Since S is non-expansive, and Corollary 3.3 yields the bound ||DPm|| < 2, 
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we readily obtain the inequality |V((5 o Pm)\ < 2. The claim now follows from the expression 
for V0(a;). □ 

3.3. The induction step 

In this subsection we fix the following objects: 

- A, a C^+^-stratification of some closed set Q C R". We emphasize that A is not assumed 
to have the Whitney property. 

- M £ A, a. stratum. 

- r = (cl M) \ M, a closed set. Whenever F is nonempty, it is a union of submanifolds in 
A, each of whose dimension is strictly less than that of M. 

- £ : R" — >■ (0, oo), a continuous function. 

- / : R" — >■ R, a locally Lipschitz, Frechet differentiable function. We emphasize that the 
gradient V/ is not assumed to be continuous. 

- Us, a tubular neighborhood of M. It is obtained by applying Corollary |3.3| with the 
functions e, /, and F as above, and V taken to be a neighborhood of M disjoint from 
all other submanifolds of dimension less than or equal to that of M. 

- (j), the interpolation function corresponding to Us- 

Since F is a disjoint union of submanifolds in A, we can unambiguously let T^T and N^T, for 
cc € F, denote the tangent and normal spaces of the submanifold where x lies. Note that we have 
so far made no assumptions on the geometry of Q beyond the existence of a CP''"^-stratification. 



Proposition 3.5 (Continuously differentiable approximation). Assume that the inclusion 
V/(a;) e TxT holds for every x GT, and define 

g = (t>foPM + {l~^)f. 

Then g satisfies the following properties. 

- g is locally Lipschitz and Frechet differentiable. 

- Vg{x) = Vf{x) for all x G F, and Vg{x) e T^M for all xe M. 

- \f{x) - g{x)\ < e{x) for all x e R". 

- lip 5 < 12 lip/. 

If A is a C^^^ - Whitney stratification and f is -smooth, then g is C^-smooth as well. 



Proof It is convenient to split the proof into a number of steps. 

Step 1 (differentiability) . We show that g is Frechet differentiable at each a; e F with Vg(x) = 
V/(a:). This is sufficient to establish the claim, since clearly g is Frechet differentiable on F'^. 
To this end, let Vk be any sequence of unit vectors in R" converging to some vector v, and 
let tk be a sequence of positive reals converging to 0. We set ■= x + ifeW^, and assuming 
without loss of generality Xk G Ug for each k, we define yk ■= Pmixk)- Observe g{x) = f{x), 
and consequently we have 

g{xk)~g{x) fjxk) - fix) fjyk) - fjxk) , . 
7 = 7 +(l){xk) • (3.3) 

tk tk tk 

The first term on the right-hand-side converges to (V/(a;), v). For the second term, the Mean 
Value Theorem applied to the function 1 f{xk + t{yk — Xk)) shows that there exists a vector 
Zk, lying on the line segment joining and y^, satisfying f{yk) - f{xk) = {'^f{zk),yk - Xk). 
Hence we obtain 

< |V/(z,)|fc^ < \Vfizk)\tk, 
tk tk 
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where the last inequahty foUows from Corollary 3.3 and the inequality \yk — Xk\ < d{xk,T)^ < 



\xk — = t^. Since V/ is locally bounded due to local Lipschitzness of /, we see that the 



second term on the right-hand-side of (3.3 1 converges to zero, thereby establishing the claim. 

Step 2 (closeness, boundary condition, Lipschitz modulus). The closeness of 5 to / follows 
from the simple calculation 

|/(x) - gix)\ = ^{x)\fix) - f o Pm{x)\ < s{x), 



where the last inequality follows from Corollary |3.3| Next, we verify the boundary condition. 
For any x G M and v G N^M we have 17(2: + tv) — f{x) for all t close to zero, since the equality 
0=1 holds in a whole neighborhood of x. It follows that the directional derivative of g in the 
direction v vanishes at x, and hence we deduce \/g(x) e T^M as desired. Finally, we prove that 
g is locally Lipschitz and derive a bound on the Lipschitz modulus of g. For x £ F"^, we have 

\Vg{x)\ < 2|V/(x)| + 0(x)|V(/ o Pm){x)\ + |V0(a;)| \f o Pm{x) - 



Due to Corollary |3.3[ we have \V{f o PM)ix)\ < 2|V/(PA/(a;))| for x e Us. Moreover, by the 
Mean Value Theorem there exists a point z on the line segment joining x and Pm{x), satisfying 
f{x) — f{PM{x)) — {\/f{z),x — Pm{x)). Consequently we deduce 

|V0(x)| \foPM{x)-f{x)\ < |V0(x)| \x^PMixWf{z)\. 

Using Lemma |3.4[ we immediately deduce that g is locally Lipschitz continuous. Similarly, one 
can easily verify the inequality 

\Vg{x)\ < 12 hp/, 

as required. 

Assume now that A is a. CP+^-Whitney stratification and / is C^-smooth. 
Step 3 (continuity of the gradient). We now show that the gradient mapping is continuous. 
It is enough to check continuity at each point x £ T. Note that for a; G F^, we have 

Vgix) - Vf{x) = Vc^{x)U o Pm{x) - !{x)) + 0(x)(V(/ o Pm){x) - Vf{x)). 

Hence, it suffices to show that the left-hand-side converges to zero as x tends to x with x ^Us. 
We do this term by term. To deal with the first term on the right-hand-side, we define y := 
Pm{x) and apply the Mean Value Theorem to obtain a vector z, lying on the line segment 
joining x and y, satisfying f{y) — f{x) = (V/(2), y — x). Assuming without loss of generality 
X 7^ we obtain 

\f{y)-f{x)\^\y-x\{yf(,z),v), 

where we define v := j^E^ G NyM. Whenever v converges to some vector v, we have v € 
NrcT by the Whitney condition. Consequently along any convergent subsequence, we obtain 
{Vf{z),v) {\/f{x),v) = 0, since V/ is continuous and Vf{x) lies in T^F. Finally recalling 



from Lemma 3.4 that the quantity |V(/()(a;)||y — a;| is bounded, we deduce that the first term on 
the right-hand-side tends to zero. 

To complete the proof of Step 3, it suffices to show |V(/ o Pm){x) — V/(a;)| -> as a; a; 
with a; £ C/a. To this end, first note that for any w E R", we have 

(V(/oPm)(x),w;) = {Vfiy),BPMix)w), 

and consequently we obtain 

(V(/ o PmKx) - V/(a:), w) = {Vf{x),BPMiy)w - w) 

+ (V/(y)-V/(a:),DPM(y)i«) 



+ {\/f{y),r>PM{x)w - DPM{y)w}. 
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Observe that the inclusion DPM{y)w — w G NyM holds, and so any limit point of such vectors 
hes in N^T. Hence along every convergent subsequence, the first term vanishes. The second and 
third terms vanish due to the continuity of V/, boundedness of ||DPm(?/)||i a-nd the inequahty 



||DP/\/(a;) — DP/\/(y)|| < d{y, T) established in Corollary 3.3 This concludes the proof of Step 3. 

□ 

If we assume that the stratification A is normally flat, stronger approximation results can 
be obtained. First we record the following observation, which is a direct consequence of the 
finiteness of the number of strata and the defining property of normally fiat stratifications. 



Lemma 3.6. Consider a normally Hat stratification A of a set Q C R" and a stratum 
M . Then there exists a neighborhood U of M such that for each stratum L G A, with L C 
(cl A'f) \ M, there is a neighborhood V of L with Pl{x) ^ Pl o Pm{x) for ah x eU r\V . 



Proposition 3.7 (Arbitrarily smooth approximation). S uwo se that the stratification A 



is normally flat, and that Us is contained in U from Lemma 
L E A with L C (cl M) \ M has a neighborhood V with 



3.6 



Assume that each stratum 



f{x) = /oPl (x) for all x e V. 

Define a function g — (/)/ o P^/ + (1 — 0)/. Then each stratum L G A with L C clM has a 
neighborhood W so that the equality 

g(x) = g o Pl{x) holds for all x E W. 



Proof. We start with the case L — M. To this end, let W be the subset of Us where the 
equality (p = 1 holds. Clearly is a neighborhood of M, and for any point x G W, we have 

g{x) = f o Pm{x) = f oPm° Pm{x) = go Pm{x), 

as required. Consider now a stratum L E A with L C (clA/) \ M . By the standing hypothesis, 
there is a neighborhood V" of L on which the equality f — f o Pl holds. Now pick a new 
neighborhood W oi L satisfying 

-wcv, 

- Pm{x) e V for allxeWn Us, 

- Pl° Pm{x) = Pl{x) for aWxGWC] Us. 

To see that such a neighborhood can be found, first note that it is easy to find W satisfying the 
first two properties, thanks to the continuity of Pm on Us'- simply intersect V with a sufficiently 
small open set containing L. We then replace W by the smaller set n where V is as in 
Lemma |3]6j The resulting set, which we again denote by W , satisfies all three properties. 

We now verify the equality g = g o Pj^ on W. First consider a point x G W \ Us. Recall we 
have (j) = outside Us and / = / o Pj^ on W, and hence we obtain 

g{x) = f{x) = foPLix)=goPL{x). 

Now consider instead a point x E W OUs. The inclusion Pm{x) G V, the equality / = / o Pl 
on V, and the normal flatness condition yield 

/ o Pm(x) = / o Pi o PMix) - / o Pl{x). 

Consequently, we deduce 



g{x) = (j>{x)f o Pm{x) + (1 - (t>{x))f{x) = f o Pl{x). 
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Finally observe that the right-hand-side equals goP]^[x), since we have 4'{Pl{x)) = 0. This 
establishes the claim. □ 

We end this subsection by recording the following remark. 

Remark 3.8. In the notation of Proposition |3.7[ observe that since the equation = 
holds on F = (clAf) \ Af, we have equality g — f there. Consequently for any stratum L 
contained in F, there exists a neighborhood V of L, so that for all x £ V we have 

Hence if / is C^-smooth and the restriction of (p to F^ is C^-smooth, then g is C-smooth as 
well. 



3.4. Proofs of main results 

Given the previous developments, the proofs of our main results are straightforward. 



Proof of Theorem 2.3 If / is not already C^, we replace it by a C°°-smooth function /o 
that differs from / by at most e/{m + 1), where m is the number of strata, and whose Fipschitz 
modulus is almost as small as that of /. 

Now, let M be a zero-dimensional stratum in A, and note that F :— (clAf) \ M is empty. 



Therefore the hypothesis of Proposition 3.5 (with /o in place of /) is vacuously true, and we 
may apply it with e/(m -I- 1) to obtain a new function /oi. Repeat this process for all the zero- 
dimensional strata to get a function /i that is Frechet differentiable and satisfies V/i(a;) G T^L 
for every point x lying in some zero-dimensional stratum L € A. Hence, if we choose a one- 



dimensional stratum M £ A, we may again apply Proposition 3.5 to get a function /n that 
again satisfies V/ii(a;) S T^L for every point x lying in some zero-dimensional stratum L € A, 
as well as V/ii(a:) G T^M for any x G M. By iterating this process m times, it is clear that 
the resulting function g is locally Lipschitz continuous, Frechet differentiable, and satisfies the 
uniform closeness and boundary properties. Moreover, in each step the Lipschitz modulus is 
increased by at most a factor of 12. Taking into account the initial smoothing of /, we get the 
stated bound on lip g. 

The support property is easily achieved by, in each step above, letting the tubular 
neighborhood Ug be contained in Vi. Similarly, to see that the claim in Remark |2.4| is valid, 
simply let Us be contained in V2 as well (and take fo — f in the beginning of the proof.) 

In case A has the Whitney property, the smoothness of /o is preserved after each 



application of Proposition |3.5[ and this yields the final assertion of Theorem 2.3 



Proof of Theorem \2.7\ The result follows exactly as in the proof of Theorem |2.3[ except that 
in each step we additionally apply Proposition |3.7[ This can be done, provided we shrink Us if 



necessary. Note that the smoothness carries over in each step by Remark 3.8 The function g 
obtained in the end has the property that every stratum M G A has a neighborhood on which 
we have g — g o Pj^f^ and this immediately yields the claimed boundary behavior. 



4. Normally flat stratifications 

In this section we discuss important examples of sets admitting normally flat stratifications. 
First, however, we observe that this notion is strictly stronger than the Whitney condition. 
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Proposition 4.1 (Normally flat stratifications are Whitney). If a stratification of a 
set Q C R" is normally flat, then for any strata L and M, with L C (el M) \ M, there exists 
a neighborhood V of L so that the inclusion 

N^M C Np^(^)L holds for all x e Mn V. 

Consequently normally flat stratifications satisfy the Whitney condition (a). 

Proof Since the stratification is normally flat, there exist neighborhoods V oi L and U of 
M so that the equality 

Pl{x) ^ Plo Pm{x) holds for aU x^VC^U. 

Consider a point x G A/ H V , and let v € N^M be an arbitrary normal vector. Define y = 
X + tv for some t > that is sufficiently small to guarantee the inclusion y £ U. Then we have 
X — Pm (y) , and consequently Pl (x) = Pl o Pm (y) — P^ (y) . We deduce 

tv^y-x^{y- PLiy)) + [Pdx) -x)£ Np^^^-jL, 

as we had to show. The claim that normally flat stratiflcations satisfy the Whitney condition 
is now immediate. □ 

The following simple example shows that the normal flatness condition is indeed strictly 
stronger than the Whitney condition. 

Example 4.2 (A Whitney stratiflcation that is not normally flat). Consider the set 

Q ^ {{x,y,z) eR^ : z ^ xy,x> 0}, 

together with the stratification A = {Mi, M2}, where the strata are defined by 

Ml = {0} X R X {0} and M2 = {{x, y, z) e Q : x > 0}. 

Clearly A is a Whitney stratification. On the other hand, this stratification is not normally 
flat. To see this, first note the equivalence 

PMAx,y,z) = (0,0,0) = 0. 

On the other hand, ^^2(2;, 0,z) is a singleton and has a non-zero j/-coordinate whenever we 
have a; > and z ^ 0. 

To record our first examples of normally flat stratifications, we need the following standard 
result. 

Lemma 4.3 (Ordered afflne subspaces). Consider afhne subspaces L and M in R", 
satisfying the inclusion L G M . Then the equality 

Pl{x)^Plo Pm {x) holds for ah a; G R" . 

Proof. Consider an arbitrary point x G R". Observe that for any point y £ L, we have 
X - Pm (x) G Np^^ (^) M and y - Pm (x) G (^) M. 
Consequently any two such vectors are orthogonal and we obtain 

\x - = |x - Pa/(x)|2 + |Pm(x) - > |a; - Pm{x)\^ + \Pm{x) - PLiPM{x))\'' 

= \x-Pl{Pm{x))\^- 
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Since y (z L is arbitrary, we deduce Pl{x) ~ Pl ° Pm{x), as claimed. □ 
We denote the affine hull of any convex set Q C R" by aff Q. 

Proposition 4.4 (Polyhedral stratifications are normally flat). Consider a stratification of 
a set Q C R", wliere each stratum is an open polyhedron. Then the stratification is normally 
flat. 

Proof. Consider strata L and M, with L C (cl M) \ M . Clearly there exists a neighborhood 
V of M and U oi L satisfying 

Pl = PaSL on V and Pm = -Paff a/ on U. 

Furthermore observe that the inclusion, affL C aff(clM) = aSM, holds. The result is now 
immediate from Lemma 14.31 □ 

It turns out that normally flat stratifications satisfy the so-called Transfer Principle, which 
we will describe below. This realization will show that many common subsets of matrices admit 
normally flat stratifications. We first introduce some notation. 

- M"^™ is the Euclidean space of all n x to real matrices, endowed with the trace inner 
product and Frobcnius norm. For notational convenience, throughout we will assume n < 

TO. 

- S" is the subspace (when m = n) of all symmetric matrices. 

- R^ is the set {(xi, . . . , a;„) e R" : a;i > 2:2 > . . . > x„}, and R*;: > R'^ n R'^. 

- A : S" — )■ R" is the map taking X to its vector {Xi(X), . . . , A„(X)) of (real) eigenvalues, 
in decreasing order. 

- a : M"^™ — >■ R" is the map taking X to its vector {<Ji{X), . . . , (T„(X)) of singular values, 
in decreasing order. (Recall throughout we are assuming n < m,.) 

- Diago; S M"^™, for a vector x E R", is a matrix that has entries all zero, except for its 
principle diagonal, which contains the entries of x. 

For notational convenience, sets Q C R" that are invariant under any permutation of 
coordinates will be called permutation-invariant, while sets that are invariant under any 
coordinate-wise change of sign and permutation of coordinates will be called absolutely 
permutation-invariant. 

Let 0{n) be the group of n x n orthogonal matrices. We can define an action of 0{n) on the 
space of symmetric matrices S" by declaring 

U.X = UXU^ for all U £ 0{n) and X in S". 

We say that a subset of S" is spectral if it is invariant under the action of 0{n). Equivalently, 
a subset of S" is spectral if and only if it can be represented as X~^{Q), for some permutation- 
invariant set Q C R". Due to this invariance of Q, the spectral set can be written simply as 
the union of orbits 

A-'(g)= U 0(n).(Diagx) 

xeQ 

Similarly, we can consider the Cartesian product 0{n) x 0{m), which we denote by 0{n, to), 
and its action on the space M"^™ defined by 

{U, V).X = UXV^ for ah {U, V) £ 0(n, m) and X in M"^". 

We say that subset of M"^"' is spectral if it is invariant under the action of 0{n,m). 
Equivalently, a subset of M"^™ is spectral if and only if it has the form <j^^{Q), for some 
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absolutely permutation-invariant set Q C R". In this situation, the spectral set is simply the 
union of orbits, 

Q = [J 0(n, m).(Diaga;) 

The mappings a and A have nice geometric properties, but are very badly behaved as far 
as, for example, differentiability is concerned. However such difficulties are alleviated by the 
invariance assumptions on Q. The Transfer Principle asserts that many geometric (or more 
generally Variational Analytic) properties of the model sets Q are inherited by the spectral sets 
'^~^{Q) and X~^{Q). The collection of properties known to satisfy this principle is impressive: 
convexity prox-regularity [4], Clarke-regularity 13 11 , smoothness 11 10 |15[ [H], 



partial smoothness (gI, and algebraicity In this section, we will add the existence of normally 
flat stratifications to this list. 

The following crucial lemma asserts that projections interact well with the singular value 
map (or eigenvalue map in the symmetric case). This result may be found in \12\ Theorem 
A.l, Theorem A. 2], and in [4j Proposition 2.3] for the symmetric case. For completeness, we 
record a short proof, adapted from |4, Proposition 2.3], below. 

Lemma 4.5 (Projection onto spectral sets). Let Q C R" be an absolutely permutation- 
invariant set and consider a matrix X € M"^™ with singular value decomposition X = 
f/Diagcr(X) V^. Then the inclusion 

Pa-HQ)iX)D{UmagzV^ -.zePQiaiX))} holds. (4.1) 

Similarly if Q C R" is only permutation-invariant and X £ S" has an eigenvalue decomposition 
X = UDiagX{X) , then the analogous inclusion 

Px-HQ){X) D {[/Diagz -.ze Pq{X{X))} holds. 

Proof. The proof relies on the fact that the singular-value map a is non-expansive, see 
Theorem 7.4.51 in [7j. Then for arbitrary Y £ Q and z G Pq(ct(X)), we obtain 

\\X-Y\\>\a{X)-a{Y)\>\a{X)-zl 

where the second inequality follows from the equivalence (j{Y) e Q <^=> Y G a~^{Q). On the 
other hand, the matrix Y = C/Diagz V'^ achieves equality and lies in ct^^(Q), and this yields 
the claim. The symmetric case follows in the same way since the eigenvalue map A is also 
non-expansive. □ 



Remark 4.6. Lemma 4.5 shows that in the particular case when P„-i(^qj{X) is a singleton. 



so is PQ{a{X)), and we have equality in (4.1 1. We furthermore have the appealing formula 

o P<j-HQ) = Pq°o- (or a o Px-^Q) = Pq° A), 
which captures the gist of the result. 



The proof of the following simple lemma may be found in 12 Lemmas A.l, A. 2], though 
the statement of the result is somewhat different. 

Lemma 4.7 (Permutations of projected points). Consider an absolutely permutation- 
invariant set Q C R" and a point x S R" > . Then for any point y lying in Pq {x) , there exists a 
signed permutation matrix A on R" so that Ay lies in IV^ > n Pg (x). The analogous statement 
holds in the symmetric case. 
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Theorem 4.8 (Lifts of stratifications). Consider a partition A of a set Q C R" into finitely 
many -submanifolds tfiat are absolutely permutation-invariant. Assume furthermore that for 
each stratum M, the spectral set a^^{M) is a CP-manifold. Then if A is a stratification of Q, 
then the family 

a-^{A) {a-\M) : M £ A] , 

is a stratification of a~^{Q). The analogous statement holds for both Whitney and normally 
flat stratifications. The case of symmetric matrices is analogous as well. 

Proof. Throughout the proof, we let L and M be arbitrary strata in A. We first claim that 
the implication 

L<Zc\M =^ c7-\L) Ccla-'^iM), (4.2) 

holds. Indeed consider a matrix X G a~^{L) and a singular value decomposition X = 
U Dia,ga{X) . Then there exists a sequence Xi — t- a{X) in M. Observe that since AI is 
absolutely permutation-invariant, the matrices Xi :— UD'mgXi V'^ lie in cr~^(M) and converge 
to X. This establishes the validity of the implication. 

Assume now that ^ is a stratification of Q. Clearly a~^{A) is a partition of <j^^{Q). Now 
suppose a~^{L) n clo-^^(il/) ^ 0. Then by continuity of cr, we have L n cl Af ^ 0. We deduce 



that the inclusion LcclM holds. Then using (4.2), we obtain a^^{L) C c\a~^{M), thus 
verifying that a^^{A) is a stratification of cr^^(Q). 

Assume now that in addition, ^ is a Whitney stratification of Q. Consider a point X E M. 



Then by 13 Theorem 7.1], we have the remarkable formula 



Nx{a~\M)) = {U(DiagN„i^x)M)V^ : Umaga{X)V^ = X}. 

Verification of the fact that a'''^{A) is a Whitney stratification of (J~^{Q) is now trivial. 

Assume now that ^ is a normally flat stratification. We will show that a~^{A) is a normally 
flat stratification of (t^^{Q). To this end suppose that the inclusion cr^^(L) C c\a~^{M) holds. 
Then there exist neighborhoods V of L and U of M, with Pl = Pl ° Pm onV C^U . Define the 
neighborhoods 

U = (j-'^{U) and V = a-^{V), 

of M and L, respectively. Shrinking U and V ^ we may suppose that the maps Pl, Pm, and the 
composition Pl o P^ are all well-defined and single- valued onV OU. Fix a matrix X £ U n V, 



4.7 



with singular value decomposition X ~ U Diaga{X) V'^ . Applying Lemma 4.5 and Lemma 
successively, we deduce 

Pa-^iL)°Pa-HM){X) = P,-i (i) (C/ Diag Pm (a(X)) V^) 
= UJ}mg[PLoPM{a{X))] 
= UBmgPL{<j{X)) 
= P.-i(i)(X), 

as claimed. The proof of the proposition in the case of symmetric matrices is similar. We leave 
the details to the reader. □ 



Remark 4.9. In Theorem |4.8[ we assumed that for each absolutely permutation- invariant 
CP stratum M, the lift a~^{M) is itself a manifold. In other words, we had assumed as 
part of the theorem that the property of being a C manifold satisfies the Transfer Principle. 
This is known to be the case in the symmetric case [6] Theorem 4.21] for p £ {2, oo}. On the 
other hand, it is unknown whether this Transfer Principle holds for nonsymmetric matrices. 
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While we conjecture this to be the case, trying to prove such a resuh would take us far off 
field, and hence we do not pursue it further. 



5. Application: matrices with nonnegative determinant 
In this section we focus on the set 

^nxn ._ 1^ g j^nxn ■ det X > O} , 

where M"^" is endowed with the trace product and the Frobenius norm. The reasons for 
studying this set are two-fold: firstly, it was one of the original motivations for developing 
the results in this paper. This is due to its role in a certain application involving stochastic 
processes, which we outline below. Secondly, and more importantly in the context of the present 
paper, it allows us to illustrate how the components of the iterative construction in Section^ ( 
be used more generally. Specifically, our goal is to establish an improved version of Theorem 
discussed below. 

We first need a few preliminaries. We begin by observing that Propositions |4.4| and 
Theorem 14.81 show that the collection of C°° manifolds 



Mk := {X G M"""" : rkX = fc} for fc = 0, . . . , n, 
is a normally flat stratification of M"^". Consequently, the collection 

A+ := {Mo, . . . , M„_i, M„+}, 

where M„_|_ is the open set 

M„+ := {X e M"^" : detX > 0}, 

is a normally flat stratification of M"^". 

For a point x £ R", define a new point G R" by setting x^ — if Xi ^ 0, and = 
otherwise. Then if a matrix X G M"^" has a decomposition X = [/Diagx with U,V £ 
0{n), the Moore-Penrose inverse of X is given by X+ := y Diaga;"'" . The transpose of the 
Moore-Penrose inverse, which we denote by 

X^ := {X+y ^ (^^)+, 

will play a role in our development. The main result of this section is the following. 



2.7 



2.7 



Theorem 5.1 (Approximation on matrices with positive determinant). Consider the set 
of matrices Q = M"^", along with the stratification A+, and let f and e be as in Theorem 
Then there is a function g : R" — >■ R that satisfies all the properties in Theorems |2.3| and 
as well as the property that the mapping 

X^{X^,Vg{X)), 

is continuous. 



Remark 5.2. In the case det X > 0, that is when X is invertible, we have X^ = {X -^)^ = 
Vlndet(X). 

The importance of Theorem |5.1| comes from its significance in the study of a class of stochastic 
processes called the matrix-valued Bessel processes, introduced in |8 . These are Markov 
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processes whose infinitesimal generator arises as an extension of the differential operator 

£= iA+^(Vlndet(X),V), 

acting o n a suitable class of functions on M"^". Here (5 > is fixed. A consequence of 
is that any compactly supported Lipschitz continuous function on M"^" can be 

2 



5.1 



Theorem 

approximated uniformly by some compactly supported function g that satisfies a Neumann 
boundary condition, and has the crucial property that Cg is bounded (in fact, the boundedness 
of Lg is the incremental benefit that motivates Theorem |5.1| ) For such functions g, it is possible 
to obtain an integration-by-parts formula, 

h{X)Cg(X)m{dX) ^ ~\ 

where m{dX) = {detX^-^dX is a Radon measure on M"^", and h varies over some class of 
test functions. The right-hand-side of the above equation extends to a Dirichlet form £{g, h) on 
L^(M"^", m). This allows one to apply standard existence results for the associated stochastic 
process, which we denote by X = (Xf : t > 0). Furthermore, for a function g that satisfies the 
integration-by-parts formula, and for which Cg is bounded, the process 



{Vh{X),Vg{X))m{dX), 



5(Xt)-5(Xo)- 



Cg{'Xs)ds, t > 0, 



is a martingale. Since, by Theorem |5.1[ this holds for a large class of functions g, it is possible 
to obtain a good description of the probability law of X. In particular, powerful uniqueness 
results become available. For a detailed discussion, see [sl. 



After this brief digression, let us proceed to the proof of Theorem 5.1 The following lemma 
establishes a key property of the Moore-Penrose inverse. 

Lemma 5.3 (Projection of the Moore-Penrose inverse). Consider a stratum M g A+ and 
a matrix X g M"'^" that when projected onto M yields a single matrix Y. Then the equahty 

Pt^m{X^)^Y^ holds. 

Proof. Let k be the rank of the matrices in M, and note that since Pm{X) is a singleton, 
the inequality ikX > k must hold. Let X = UT,V^ be a singular value decomposition of X, 
and let E/c be the diagonal matrix obtained from S by setting all but the fc la rgest entries to 



zero. Then we have the equality Y ~ C/EfeT^^, as is apparent from Lemma 4.5 Observe, 

x=F = r=F _^ - y=F) = c/E+F^ + [/ (E+ - E+) v^. 

The desired result follows once we show that the right-hand-side is the decomposition of X^ 
as a direct sum in TyM ® NyM. To do this, first note that for small t, the matrix 

Y + tUY.+V^ = C/(Efe + tE+)y^, 

has rank fc, and hence lies in A/. Consequently the inclusion [/E^y^ e TyM holds. Next, for 
small t we have 

Pm {Y + tU (E+ - E+) V^) ^ Pm {U (Efc + t(E+ - E+)) V'^) = F, 
thus verifying that the matrix U (E+ — E^) lies in NyM, as required. □ 

Next, we establish the validity of the following inductive step. We will use the well-known 
property that the Moore-Penrose inverse mapping X H' X^ is continuous on each stratum M 
of A+. 
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Proposition 5.4. Consider the setup and notation given in Subsection 3.3 witli A = A-\ 
Assume that the following properties hold. 



3.6 



(i) Us is contained in the set U from Lemma . 

(ii) / is smooth, and we have V/(X) e TxT for every X eT, 

(iii) each stratum L G A+ with L CT has a neighborhood V such that 

f{X) = foPL{X) foraU X e V, 

(iv) the map X i-^ {'Vf{X),X^) is continuous at every X gT. 
Define a function g = <j)f o P^,/ + (1 — </>)/. Then the mapping 

X^{Vg{X),X^), 

is continuous at every X e F U M. 



Proof First note that, by Proposition 3.5 the function g is again C""^ smooth, with V/ = 
on F. Consider a matrix X e M, and write Y — Pm{X) as usual for X near M . Since we have 
g = f ° Pm near M, we deduce 

(V5(X),XT) = (V/(r),DPM(^)[^^]). 



Using Lemma 5.3 and the observation 'DPm{X)[U] = for J7 G NyM, we obtain 

BPm{X)[X^] = BPm{X)[Pt^m{X^)] = BPm{X)[Y^]. 

From the convergence X ^ X, we deduce Y ^ X, and consequently Y^ — ^ X^ , since the 
matrices Y and X have the same rank. We successively conclude, 

lim_(Vg(X),XT) = (V/(X),DPm(X)[X^]) 

= (V(/oPm)(Z),Z^) 
= {Vg(X),X^), 

thereby verifying continuity at X . Consider now a stratum L C F and a point X L. Since we 
have V/ — Vg outside Us, we can assume X gUs- We obtain 

{Vg{X),X^) - {Vf{X),X^) = 0(X)(V(/o Pm)(X) - Vf{X),X^) (5.1) 

+ (/oPm(X)-/(X))(V0(X),XT). 

Observe that whenever X is close enough to X, we have 

/ o Pm{X) = / o Pi o Pm{X) = f o P^iX) = f{X), 

due to the assumption on / and normal flatness of A+. Consequently, the right-hand-side of 



(5.1) vanishes as soon as X gets sufficiently close to X. The result follows. □ 



Proof of Theorem \5.1\ We simply carry out the same iterative procedure as in the proofs 

in 



of Theorems 2.3 and |2.7| (see Subsection 3.4), while in addition applying Proposition 5.4 



each step. Note that each time this is done, the hypotheses {ii) and {iii) will be satisfied since 



Propositions 3.5 and 3.7 were applied in some earlier iteration (initially these hypotheses are 
vacuously true.) The hypothesis {i) is easily achieved by shrinking Us if necessary, and {iv) 
holds due to earlier applications of Proposition |5.4| 
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